# Unified Image-Text Representation
Florence 2 Large No Flash Attn
MIT
Florence-2 is an advanced vision foundation model developed by Microsoft, employing a prompt-based approach to handle diverse visual tasks through unified representation, enabling functions like image captioning and object detection.
Text-to-Image
PyTorch
F
multimodalart
73.91k
16
Florence 2 Base Ft
MIT
Florence-2 is an advanced vision foundation model developed by Microsoft, employing a prompt-based approach to handle a wide range of vision and vision-language tasks.
Image-to-Text
Transformers

F
microsoft
56.78k
110
Featured Recommended AI Models